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© Fusion proteins containing N-terminal fragments of human serum albumin. 



© A fusion polypeptide comprising, as at least part of the N-terminal portion thereof, an N-terminal portion of 
HSA or a variant thereof and, as at least part of the C-terminai portion thereof, another polypeptide except that, 
when the said N-terminal portion of HSA is the 1-n portion where n is 369 to 419 or a variant thereof then the 
said polypeptide is one of various specified entities, including the 585 to 1578 portion of human fibronectin or a 
variant thereof. 

The HSA-like portion may have additional N-terminal residues, such as secretion leader sequences (signal 
sequences). The C-terminal portion is preferably the 585-1578 portion of human plasma fibronectin. The N- 
terminal and C-terminal portions may be cleavable to yield the isolated C-terminal portion, with the N-terminal 
portion having served to facilitate secretion from the host 
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POLYPEPTIDES 



The present invention relates to fusion polypeptides where two individual polypeptides or parts thereof 
are fused to form a single amino acid chain. Such fusion may arise from the expression of a single 
continuous coding sequence formed by recombinant DNA techniques. 

Fusion polypeptides are known, for example those where a polypeptide which is the ultimately desired 
5 product of the process is expressed with an N-terminal "leader sequence" which encourages or allows 
secretion of the polypeptide from the cell. An example is disclosed in EP-A-116 201 (Chiron). 

Human serum albumin (HSA) is a known protein found in the blood. EP-A-147 198 (Delta Biotechnol- 
ogy) discloses its expression in a transformed host, in this case yeast. Our earlier application EP-A-322 094 
discloses N-terminal fragments of HSA, namely those consisting of residues 1-n where n is 369 to 419, 
10 which have therapeutic utility. The application also mentions the possibility of fusing the C-terminal residue 
of such molecules to other, unnamed, polypeptides. 

One aspect of the present invention provides a fusion polypeptide comprising, as at least part of the N- 
terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part of the O 
terminal portion thereof, another polypeptide except that, when the said N-terminal portion of HSA is the 1-n 
75 portion where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 to 1578 portion of 
human fibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet 
derived growth factor, or a variant thereof, (d) transforming growth factor, or a variant thereof, (e) the 1-261 
portion of mature human plasma fibronectin or a variant thereof, (f) the 278-578 portion of mature human 
plasma fibronectin or a variant thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a 
20 variant thereof, or (h) alpha- 1 -antitrypsin or a variant thereof. 

The N-terminal portion of HSA is preferably the said 1-n portion, the 1-177 portion (up to and including 
the cysteine), the 1-200 portion (up to but excluding the cysteine) or a portion intermediate 1-177 and 1- 
200. 

The term "human serum albumin" (HSA) is intended to include (but not necessarily to be restricted to) 

25 known or yet-to-be-discovered polymorphic forms of HSA. For example, albumin Naskapi has Lys-372 in 
place of GIu-372 and pro-albumin Christchurch has an altered pro-sequence. The term "variants" is 
intended to include (but not necessarily to be restricted to) minor artificial variations in sequence (such as 
molecules lacking one or a few residues, having conservative substitutions or minor insertions of residues, 
or having minor variations of amino acid structure). Thus polypeptides which have 80%, preferably 85%, 

30 90%, 95% or 99%, homology with HSA are deemed to be "variants". It is also preferred for such variants to 
be physiologically equivalent to HSA; that is to say, variants preferably share at least one pharmacological 
utility with HSA. Furthermore, any putative variant which is to be used pharmacologically should be non- 
immunogenic in the animal (especially human) being treated. 

Conservative substitutions are those where one or more amino acids are substituted for others having 

35 similar properties such that one skilled in the art of polypeptide chemistry would expect at least the 
secondary structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. 
For example, typical such substitutions include asparagine for glutamine, serine for asparagine and arginine 
for lysine. Variants may alternatively, or as well, lack up to ten (preferably only one or two) intermediate 
amino acid residues (ie not at the termini of the said N-terminal portion of HSA) in comparison with the 

40 corresponding portion of natural HSA; preferably any such omissions occur in the 100 to 369 portion of the 
molecule (relative to mature HSA itself) (if present). Similarly, up to ten, but preferably only one or two, 
amino acids may be added, again in the 100 to 369 portion for preference (if present). The term 
"physiologically functional equivalents" also encompasses larger molecules comprising the said sequence 
plus a further sequence at the N-terminal (for example, pro-HSA, pre-pro-HSA and met-HSA). 

45 Clearly, the said "another polypeptide" in the fusion compounds of the invention cannot be the 
remaining portion of HSA, since otherwise the whole polypeptide would be HSA, which would not then be a 
"fusion polypeptide". 

Even when the HSA-like portion is not the said 1-n portion of HSA, it is preferred for the non-HSA 
portion to be one of the said (a) to (h) entities. 
so The 1 to 368 portion of CD4 represents the first four disulphide-linked immunoglobulin-like domains of 
the human T lymphocyte CD4 protein, the gene for and amino acid sequence of which are disclosed in D. 
Smith et al (1987) Science 328, 1704-1707. It is used to combat HIV infections. 

The sequence of human platelet-derived growth factor (PDGF) is described in Collins et al (1985) 
Nature 316 , 748-750. Similarly, the sequence of transforming growth factors j3 (TGF-/3) is described in 
Derynck et al (1985) Nature 316, 701-705. These growth factors are useful for wound-healing. 
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A cDNA sequence for the 1-261 portion of Fn was disclosed in EP-A-207 751 (obtained from plasmid 
pFH6 with endonuclease Pvull). This portion binds fibrin and can be used to direct fused compounds to 
blood clots. ~ — 

A cDNA sequence for the 278-578 portion of Fn, which contains a collagen-binding domain, was 
5 disclosed by R.J. Owens and F.E. Baralle in 1986 E.M. B.O.J. 5, 2825-2830. This portion will bind to 
platelets. 

The 1-272 portion of von Willebrand f s Factor binds and stabilises factor VIII. The sequence is given in 
Bontham et al, Nucl. Acids Res. 14, 7125-7127. 

Variants of alpha-1 -antitrypsin include those disclosed by Rosenburg et al (1984) Nature 312 , 77-80. In 

10 particular, the present invention includes the Pittsburgh variant (Met 358 is mutated to Arg) and the variant 
where Pro 357 and Met 358 are mutated to alanine and arginine respectively. These compounds are useful in 
the treatment of septic shock and lung disorders. 

Variants of the non-HSA portion of the polypeptides of the invention include variations as discussed 
above in relation to the HSA portion, including those with conservative amino acid substitutions, and also 

15 homologues from other species. 

The fusion polypeptides of the invention may have N-terminal amino acids which extend beyond the 
portion corresponding to the N-terminal portion of HSA. For example, if the HSA-like portion corresponds to 
an N-terminal portion of mature HSA, then pre-, pro-, or pre-pro sequences may be added thereto, for 
example the yeast alpha-factor leader sequence. The fused leader portions of WO 90/01063 may be used. 

20 The polypeptide which is fused to the HSA portion may be a naturally-occurring polypeptide, a fragment 
thereof or a novel polypeptide, including a fusion polypeptide. For example, in Example 3 below, a fragment 
of fibronectin is fused to the HSA portion via a 4 amino acid linker. 

It has been found that the amino terminal portion of the HSA molecule is so structured as to favour 
particularly efficient translocation and export of the fusion compounds of the invention in eukaryotic cells. 

25 A second aspect of the invention provides a transformed host having a nucleotide sequence so 
arranged as to express a fusion polypeptide as described above. By "so arranged", we mean, for example, 
that the nucleotide sequence is in correct reading frame with an appropriate RNA polymerase binding site 
and translation start sequence and is under the control of a suitable promoter. The promoter may be 
homologous with or heterologous to the host. Downstream (3 ) regulatory sequences may be included if 

30 desired, as is known. The host is preferably yeast (for example Saccharomyces spp., e.g. S. cerevisiae ; 
Kluyveromyces spp., e.g. K. lactis; Pichia spp.; or Schizosaccharomyces spp., e.g. S. pombe ) but may be 
any other suitable host such as E. coli, B. subtilis , "Aspergillus spp., mammalian cells, plant cells or insect 
cells. 

A third aspect of the invention provides a process for preparing a fusion polypeptide according to the 

35 first aspect of the invention by cultivation of a transformed host according to the second aspect of the 
invention, followed by separation of the fusion polypeptide in a useful form. 

A fourth aspect of the invention provides therapeutic methods of treatment of the human or other animal 
body comprising administration of such a fusion polypeptide. 

In the methods of the invention we are particularly concerned to improve the efficiency of secretion of 

40 useful therapeutic human proteins from yeast and have conceived the idea of fusing to amino-terminai 
portions of HSA those proteins which may ordinarily be only inefficiently secreted. One such protein is a 
potentially valuable wound-healing polypeptide representing amino acids 585 to 1578 of human fibronectin 
(referred to herein as Fn 585-1578). As we have described in a separate application (filed simultaneously 
herewith) this molecule contains cell spreading, chemotactic and chemokinetic activities useful in healing 

45 wounds. The fusion polypeptides of the present invention wherein the C-terminal portion is Fn 585-1578 can 
be used for wound healing applications as biosynthesised, especially where the hybrid human protein will 
be topically applied. However, the portion representing amino acids 585 to 1578 of human fibronectin can if 
desired be recovered from the fusion protein by preceding the first amino acid of the fibronectin portion by 
amino acids comprising a factor X cleavage site. After isolation of the fusion protein from culture 

50 supernatant, the desired molecule is released by factor X cleavage and purified by suitable chromatography 
(e.g. ion-exchange chromatography). Other sites providing for enzymatic or chemical cleavage can be 
provided, either by appropriate juxtaposition of the N-terminal and C-terminal portions or by the insertion 
therebetween of an appropriate linker. 

At least some of the fusion polypeptides of the invention, especially those including the said CD4 and 

55 vWF fragments, PDGF and ai AT, also have an increased half-life in the blood and therefore have 
advantages and therapeutic utilities themselves, namely the therapeutic utility of the non-HSA portion of the 
molecule. In the case of aiAT and others, the compound will normally be administered as a one-off dose or 
only a few doses over a short period, rather than over a long period, and therefore the compounds are less 
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likely to cause an immune response. 



EXAMPLES : SUMMARY 

5 

Standard recombinant DNA procedures were as described by Maniatis et al (1982 and recent 2nd 
edition) unless otherwise stated. Construction and analysis of phage M13 recombinant clones was as 
described by Messing (1983) and Sanger et al (1977). 

DNA sequences encoding portions of human serum albumin used in the construction of the following 
10 molecules are derived from the plasmids mHOB12 and pDBD2 (EP-A-322 094, Delta Biotechnology Ltd, 
relevant portions of which are reproduced below) or by synthesis of oligonucleotides equivalent to parts of 
this sequence. DNA sequences encoding portions of human fibronectin are derived from the plasmid 
pFHDELI, or by synthesis of oligonucleotides equivalent to parts of this sequence. Plasmid pFHDELI, 
which contains the complete human cDNA encoding plasma fibronectin, was obtained by ligation of DNA 
75 derived from plasmids pFH6, 16, 54, 154 and 1 (EP-A-207 751; Delta Biotechnology Ltd). 

This DNA represents an mRNA variant which does not contain the 'ED 1 sequence and had an 89-amino 
acid variant of the Hl-CS region (R.J. Owens, A.R. Kornblihtt and F.E. Baralle (1986) Oxford Surveys on 
Eukaryotic Genes 3 141-160). The map of this vector is disclosed in Fig. 11 and the protein sequence of 
the mature polypeptide produced by expression of this cDNA is shown in Fig. 5. 
20 Oligonucleotides were synthesised on an Applied Biosystems 380B oligonucleotide synthesiser accord- 
ing to the manufacturer's recommendations (Applied Biosystems, Warrington, Cheshire, UK). 

An expression vector was constructed in which DNA encoding the HSA secretion signal and mature 
HSA up to and including the 387th amino acid, leucine, fused in frame to DNA encoding a segment of 
human fibronectin representing amino acids 585 to 1578 inclusive, was placed downstream of the hybrid 
25 promoter of EP-A-258 067 (Delta Biotechnology), which is a highly efficient galactose-inducible promoter 
functional in Saccharomyces cerevisiae. The codon for the 1578th amino acid of human fibronectin was 
directly followed by a stop codon (TAA) and then the S. cerevisiae phosphogiycerate kinase (PGK) gene 
transcription terminator. This vector was then introduced into S. cerevisiae by transformation, wherein it 
directed the expression and secretion from the cells of a hybricTmolecule representing the N-terminal 387 
30 amino acids of HSA C-terminally fused to amino acids 585 to 1578 of human fibronectin. 

In a second example a similar vector is constructed so as to enable secretion by S. cerevisiae of a 
hybrid molecule representing the N-terminal 195 amino acids of HSA C-terminally fused toamino acids 585 
to 1578 of human fibronectin. 

Aspects of the present invention will now be described by way of example and with reference to the 
35 accompanying drawings, in which: 

Figure 1 (on two sheets) depicts the amino acid sequence currently thought to be the most 
representative of natural HSA, with (boxed) the alternative C-termini of HSA(1-n); 

Figure 2 (on two sheets) depicts the DNA sequence coding for mature HSA, wherein the sequence 
included in Linker 3 is underlined; 
40 Figure 3 illustrates, diagrammatically, the construction of mHOB16; 

Figure 4 illustrates, diagrammatically, the construction of pHOB31 ; 

Figure 5 (on 6 sheets) illustrates the mature protein sequence encoded by the Fn plasmid pFHDELI; 
Figure 6 illustrates Linker 5, showing the eight constituent oligonucleotides; 
Figure 7 shows schematically the construction of plasmid pDBDF2; 
45 Figure 8 shows schematically the construction of plasmid pDBDFS; 

Figure 9 shows schematically the construction of plasmid pDBDF9; 

Figure 10 shows schematically the construction of plasmid DBDF12, using plasmid pFHDELI; and 
Figure 11 shows a map of plasmid pFHDELI. 

50 

EXAMPLE 1 : HSA 1-387 FUSED TO Fn 585-1578 

The following is an account of a preparation of plasmids comprising sequences encoding a portion of 
HSA, as is disclosed in EP-A-322 094. 
55 The human serum albumin coding sequence used in the construction of the following molecules is 
derived from the plasmid M13mp19.7 (EP-A-201 239, Delta Biotech- nology Ltd.) or by synthesis of 
oligonucleotides equivalent to parts of this sequence. Oligonucleotides were synthesised using 
phosphoramidite chemistry on an Applied Biosystems 380B oligonucleotide synthesizer according to the 
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manufacturer's recommendations (AB Inc., Warrington, Cheshire, England). 

An oligonucleotide was synthesised (Linker A) which represented a part of the known HSA coding 
sequence (Figure 2) from the Pstl site (1235-1240, Figure 2) to the codon for valine 381 wherein that codon 
was changed from GTG to GTCT 



Linker 1 
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5' 




GAT 


CCT CAT 


GAA 


TGC 


TAT 


3' ACGT 


CTA 


GGA GTA 


CTT 


ACG 


ATA 
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AAA 


GTG 


TTC GAT 
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TTT 


AAA 


CGG 


TTT 


CAC 


AAG CTA 


CTT 


AAA 


TTT 






1267 
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V 










CTT 


GTC 


3' 










GGA 


CAG 


5' 
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Linker 1 was ligated into the vector M13mp19 (Norrander et al, 1983) which had been digested with Pstl 
and Hindi and the ligation mixture was used to transfect E.coli strain XL1-B!ue (Stratagene Cloning 

40 Systems, San Diego, CA). Recombinant clones were identified by their failure to evolve a blue colour on 
medium containing the chromogenic indicator X-gal (5-bromo-4-chloro-3-indolyl-$-D-galactoside) in the 
present of IPTG (isopropylthio-jfl-galactoside). DNA sequence analysis of template DNA prepared from 
bacteriophage particles of recombinant clones identified a molecule with the required DNA sequence, 
designated mHOB12 (Figure 3). M13mp19.7 consists of the coding region of mature HSA in M13mp19 

45 (Norrander et al, 1983) such that the codon for the first amino acid of HSA, GAT, overlaps a unique Xho i 
site thus: ~ ~~ 



Asp Ala 
CTCGAGATGCA 3' 
GAGCTCTACGT 5' 
Xho i 



50 

5' 
3' 

55 
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(EP-A-210 239). Ml3mp19.7 was digested with Xhol and made flush-ended by S1 -nuclease treatment and 
was then ligated with the following oligonucleotide (Linker 2): 



Linker 2 



5'TCTTTTATCCAAGCTTGGATAAAAGA 3, 

3'AGAAAATAGGTTCGAACCTATTTTCT 5' 

Hind i I I 

The ligation mix was then used to transfect E.coli XL1-Blue and template DNA was prepared from 
several plaques and then analysed by DNA sequencing to identify a clone, pDBD1 (Figure 4), with the 
correct sequence. 

A 1 .1 kb Hindlll to Pstl fragment representing the 5 end of the HSA coding region and one haif of the 
inserted oligonucleotide linker was isolated from pDBD1 by agarose gel electrophoresis. This fragment was 
then ligated with double stranded mHOB12 previously digested with Hindlll and Pstl and the ligation mix 
was then used to transfect E.coli XL1-Blue. Single stranded template DNA was prepared from mature 
bacteriophage particles of several plaques. The DNA was made double stranded in vitro by extension from 
annealed sequencing primer with the Klenow fragment of DNA polymerase I in the presence of deox- 
ynucleoside triphosphates. Restriction enzyme analysis of this DNA permitted the identification of a clone 
with the correct configuration, mHOB15 (Figure 4). 

The following oligonucleotide (Linker 3) represents from the codon for the 382nd amino acid of mature 
HSA (glutamate, GAA) to the codon for lysine 389 which is followed by a stop codon (TAA) and a Hindlll 
site and then a Bam HI cohesive end: — 

Linker 3 

EEPQNLIKJ 
5 ' GAA GAG CCT CAG AAT TTA ATC AAA TAA GCTTG 3 ' 

3' CTT CTC GGA GTC TTA AAT TAG TTT ATT CGAACCTAG 5' 



This was ligated into double stranded mHOB15, previously digested with Hindi and BamHI. After 
45 ligation, the DNA was digested with Hindi to destroy all non-recombinant mole"cu1es and then used to 
transfect E.coli XL1-Blue. Single stranded DNA was prepared from bacteriophage particles of a number of 
clones and subjected to DNA sequence analysis. One clone having the correct DNA sequence was 
designated mHOB16 (Figure 4). 

A molecule in which the mature HSA coding region was fused to the HSA secretion signal was created 
50 by insertion of Linker 4 into Bam HI and Xho l digested M13mp19.7 to form pDBD2 (Figure 4). 

Linker 4 

55 
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M K W V S F 

5' GATCC ATG AAG TGG GTA AGC TTT 

5 

G TAC TTC ACC CAT TCG AAA 

10 I SLLFLFS 

ATT TCC CTT CTT TTT CTC TTT AGC 



TAA AGG GAA GAA AAA GAG AAA TCG 



75 
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G 
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TCG 


GCT 


TAT 


TCC 


AGG 


GGT 


GTG 


TTT 


AGC 


CGA 


ATA 


AGG 


TCC 


CCA 


CAC 


AAA 



25 



R R 
CG 3' 
GCAGCT 5 ' 



35 In this linker the codon for the fourth amino acid after the initial methionine, ACC for threonine in the 
HSA pre-pro leader sequence (Lawn et al, 1981), has been changed to AGC for serine to create a Hindlll 
site. ~ ~~ 

A sequence of synthetic DNA representing a part of the known HSA coding sequence (Lawn et al M 
1981) (amino acids 382 to 387, Fig. 2), fused to part of the known fibronectin coding sequence (KornblThtfet 

40 al., 1985) (amino acids 585 to 640, Fig. 2), was prepared by synthesising six oligonucleotides (Linker 5, Fig. 

6) . The oligonucleotides 2, 3, 4, 6, 7 and 8 were phosphorylated using T4 polynucleotide kinase and then 
the oligonucleotides were annealed under standard conditions in pairs, i.e. 1+8, 2 + 7, 3 + 6 and 4 + 5. The 
annealed oligonucleotides were then mixed together and ligated with mHOB12 which had previously been 
digested with the restriction enzymes Hindi and EcoRI. The ligation mixture was then used to transfect 

45 E.coli XL1-Blue (Stratagene Cloning Systems, San Diego, CA). Single stranded template DNA was then 
prepared from mature bacteriophage particles derived from several independent plaques and then was 
analysed by DNA sequencing. A clone in which a linker of the expected sequence had been correctly 
inserted into the vector was designated pDBDFI (Fig. 7). This plasmid was then digested with Pstl and 
EcoRI and the approx. 0.24kb fragment was purified and then ligated with the 1 .29kb Bam Hl-Pstl fragment 

50 oTpDBD2 (Fig. 7) and Bam HI + Eco RI digested pUC19 (Yanisch-Perron, et al., 1985) to form pDBDF2 (Fig. 

7) . 

A plasmid containing a DNA sequence encoding full length human fibronectin, pFHDELI, was digested 
with Eco RI and Xho l and a 0.77kb Eco Ri-xhol fragment (Fig. 8) was isolated and then ligated with Eco RI 
and SaTdigeste(TM13 mp18 (Norrander etaC 1983) to form pDBDF3 (Fig. 8). ~~ 
55 The following oligonucleotide linker (Linker 6) was synthesised, representing from the Pstl site at 4784- 
4791 of the fibronectin sequence of EP-A-207 751 to the codon for tyrosine 1578 (Fig. 5) which is followed 
by a stop codon (TAA), a Hindlll site and then a BamHl cohesive end: 



7 



EP 0 399 666 A1 



Linker 6 



GPDQTEMTI EGL 
GGT CCA GAT CAA ACA GAA ATG ACT ATT GAA GGC TTG 
A CGT CCA GGT CTA GTT TGT CTT TAC TGA TAA CTT CCG AAC 



Q 


P 


T 


V 


E 


Y 


Stop 


CAG 


ccc 


ACA 


GTG 


GAG 


TAT 


TAA 


GTC 


GGG 


TGT 


CAC 


CTC 


ATA 


ATT 



20 

This linker was then ligated with Pstl and Hindlll digested pDBDF3 to form pDBDF4 (Fig. 8). The 
following DNA fragments were then ligated together with Bglll digested pKV50 (EP-A-258 067) as shown in 
Fig. 8: 0.68kb EcoRI- Bam HI fragment of pDBDF4, 1.5kb BamHl-StuI fragment of pDBDF2 and the 2.2kb 
Stul-EcoR! fragment of pFHDELI . The resultant plasmid pDBDFS (Fig. 8) includes the promoter of EP-A- 

25 258 067 to direct the expression of the HSA secretion signal fused to DNA encoding amino acids 1-387 of 
mature HSA, in turn fused directly and in frame with DNA encoding amino acids 585-1578 of human 
fibronectin, after which translation would terminate at the stop codon TAA. This is then followed by the 
S.cerevisiae PGK gene transcription terminator. The plasmid also contains sequences which permit 
selection and maintenance in Escherichia coli and S.cerevisiae (EP-A-258 067). 

30 This plasmid was introduced into S.cerevisiae S1 50-2B ( leu2-3 l eu2-112 ura3-52 trpl-289 his3- 1_) by 
standard procedures (Beggs s 1978). Transformants were subsequently analysed and found to produce the 
HSA-fibronectin fusion protein. 



35 EXAMPLE 2 : HSA 1-195 FUSED TO Fn 585-1578 

In this second example the first domain of human serum albumin (amino acids 1-195) is fused to amino 
acids 585-1578 of human fibronectin. 

The plasmid pDBD2 was digested with BamHI and Bglll and the 0.79kb fragment was purified and then 
40 Hcjated with BamHI-digested M13mp19 to form pDBDF6 (Fig. 6). The following oligonucleotide: 
5 -C C A A AGC TCGAGGAACTTC G-3' 

was used as a mutagenic primer to create a Xhol site in pDBDF6 by in vitro mutagenesis using a kit 
supplied by Amersham International PLC. This site was created by changing base number 696 of HSA from 
a T to a G (Fig. 2). The plasmid thus formed was designated pDBDF7 (Fig. 9). The following linker was then 
45 synthesised to represent from this newly created Xho l site to the codon for lysine 195 of HSA (AAA) and 
then from the codon for isoleucine 585 of fibronectin to the ends of oligonucleotides 1 and 8 shown in Fig. 
6. 



so Linker 7 



55 
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del'rdegkassak 
tc gat gaa ctt cgg gat gaa ggg aag gct tcg tct gcc aaa 
a ctt gaa gcc cta ctt ccc ttc cga agc aga cgg ttt 



ITETPSQPNSH 
ATC ACT GAG ACT CCG AGT CAG C 

TAG TGA CTC TGA GGC TCA GTC GGG TTG AGG GTG G 



This linker was ligated with the annealed oligonucleotides shown in Fig. 3, i.e. 2+7, 3 + 6 and 4 + 5 
together with Xhol and EcoRI digested pDBDF7 to form pDBDF8 (Fig. 9). Note that in order to recreate the 
original HSA~DNA sequence, and hence amino acid sequence, insertion of linker 7 and the other 
oligonucleotides into pDBDF7 does not recreate the Xho l site. 

The 0.83kb BamHi-StuI fragment of pDBDF8 was purified and then was ligated with the 0.68kb EcoRI- 
BamHI fragmentofpDBDF2 and the 2,22kb Stul-EcoRI fragment of pFHDELI into Bglll-digested pKV50 to 
form pDBDF9 (Fig. 9). This plasmid is similar to pDBDFS except that it specifies only residues 1-195 of 
HSA rather than 1-387 as in pDBDF5. 

When introduced into S.cerevisiae S150-2B as above, the plasmid directed the expression and 
secretion of a hybrid molecule composed of residues 1-195 of HSA fused to residues 585-1578 of 
fibronectin. 

EXAMPLE 3 : HSA 1-387 FUSED TO Fn 585-1578, AS CLEAVABLE MOLECULE 

In order to facilitate production of large amounts of residues 585-1578 of fibronectin, a construct was 
made in which DNA encoding residues 1-387 of HSA was separated from DNA encoding residues 585-1578 
of fibronectin by the sequence 



I E G R 
ATT GAA GGT AGA 
TAA CTT CCA TCT 



which specifies the cleavage recognition site for the blood clotting Factor X. Consequently the purified 
secre ted product can be treated with Factor X and then the fibronectin part of the molecule can be 
separated from the HSA part. 

To do this two oligonucleotides were synthesised and then annealed to form Linker 8. 

Linker 8 
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20 



TTG AGG GTG G 



This linker was then ligated with the annealed oligonucleotides shown in Fig. 6, i.e. 2 + 7, 3 + 6 and 4 + 5 
into Hindi andEcoRI digested mH0B12, to form pDBDF10 (Fig. 7). The plasmid was then digested with Pstl 
and Eco RI and the roughly 0.24kb fragment was purified and then ligated with the 1.29kb Bam Hi-Pstl 
fragment of pDBD2 and Bam Hi and Eco RI digested pUC19 to form pDBDFIl (Fig. 10). — "~ — 

The 1.5kb Bam Hi-StuI fragment of pDBDF11 was then ligated with the 0.68kb EcoRI-BamH1 fragment 
of pDBDF4 and the 2.22kb Stul-EcoRI fragment of pFHDEU into Bglll-digested pKV50 toTorm pDBDF12 
(Fig. 10). This plasmid was then introduced into S.cerevisiae S150-2B. The purified secreted fusion protein 
was treated with Factor X to liberate the fibronectin fragment representing residues 585-1578 of the native 
molecule. 
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Claims 

1. A fusion polypeptide comprising, as at least part of the N-terminal portion thereof, an N-terminal 
portion of HSA or a variant thereof and, as at least part of the C-terminal portion thereof, another 
polypeptide except that, when the said N-terminal portion of HSA is the 1-n portion where n is 369 to 419 or 
a variant thereof then the said polypeptide is (a) the 585 to 1578 portion of human fibronectin or a variant 
thereof, (b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth factor or a variant 
thereof, (d) transforming growth factor fi or a variant thereof, (e) the 1-261 portion of mature human plasma 
fibronectin or a variant thereof, (f) the 278-578 portion of mature human plasma fibronectin or a variant 
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thereof, (g) the 1-272 portion of mature human von WillebrancTs Factor or a variant thereof, or (h) alpha-1- 
antitrypsin or a variant thereof. 

2. A fusion polypeptide according to Claim 1 additionally comprising at least one N-terminal amino acid 
extending beyond the portion corresponding to the N-terminal portion of HSA. 
5 3. A fusion polypeptide according to Claim 1 or 2 wherein there is a cleavable region at the junction of 
the said N-terminal or C-terminal portions. 

4. A fusion polypeptide according to any one of the preceding claims wherein the said C-terminal 
portion is the 585 to 1578 portion of human plasma fibronectin or a variant thereof. 

5. A transformed or transfected host having a nucleotide sequence so arranged as to express a fusion 
10 polypeptide according to any one of the preceding claims. 

6. A process for preparing a fusion polypeptide by cultivation of a host according to Claim 5, followed 
by separation of the fusion polypeptide in a useful form. 

1. A fusion polypeptide according to any one of Claims 1 to 4 for use in therapy. 



11 



EP 0 399 666 A1 



FIGURE 1 

20 



asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys 

30 40 
Ala Leu Val Leu He Ala Phe Ala Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp His Val 

Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu 

70 80 
Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu 

Arg Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pre Glu Arg Asn Glu 

110 120 
Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Val 

130 I 40 
Asp Val Met Cys Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr 

150 160 
Glu He Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 

170 

Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro 

190 200 
Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gin Arg Leu Lys Cys 

210 f° 
Ala Ser Leu Gin Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser 

230 240 
Gin Arg Phe Pro Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys 

250 i 260 

Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu 

™ „ 2 *° 

Ala Lys Tyr He Cys Glu Asn Gin Asp Ser He Ser Ser Lys Leu Lys Glu Cys Cys Glu 

■ 290 300 
Lys Pro Leu Leu Glu Lys Ser His Cys He Ala Glu Val Glu Asn Asp Glu Met Pro Ala 

310 320 
Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala 

330 340 
Glu Ala Lys Asp val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp 

350 360 
Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys 



380 



!370 
Cvs Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu 
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FIGURE 1 Cont. 

— 5^5 - 400 

Val Glu Glu Pro Gin Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu Gin Leu Gly Glu 



Tyr Lys Phe 'Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro Gin Val Ser 



420 
Thr 



430 440 
Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 

450 460 
Pro Glu Ala Lys Arg Met Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu 

470 480 
Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser 

490 500 
Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr Val Pro Lys 

510 520 
Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp lie Cys Thr Leu Ser Glu Lys Glu 

530 540 
Arg Gin lie Lys Lys Gin Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr 

550 560 
Lys Glu Gin Leu Lys Ala Val Met Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys 

570 580 
Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gin 

Ala Ala Leu Gly Leu 
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FIGURE 2 DNA sequence coding for mature HSA 



10 20 30 40 50 60 70 80 

GATGCACACAAGAGTGAGGTTGCTCATCGGTTTAAAGATTTGGGAGAAGAAAATTTCAAAGCCTTGGTGTTGATTGCCTT 
DAHKSEVAHRFKD&GEENFKALVLIAF 

90 100 110 120 130 140 150' 160 

TGCTCAGTATCTTCAGCAGTGTCCATTTGAAGATCATGTAAAATTAGTGAATGAAGTAACTGAATTTGCAAAAACATGTG 
AQYLQQCPFEDHVKLVNEVTEFAKTC 

17 0 180 190 200 210 220 230 240 

TTGCTGATGAGTCAGCTGAAAATTGTGACAAATCACTTCATACCCTTTTTGGAGACAAATTATGCACAGTTGCAACTCTT 
VADESAENCDKSLHTLFGDKLCTVATL 

250 260 270 280 290 300 310 320 

CGTGAAACCTATGGTGAAATGGCTGACTGCTGTGCAAAACAAGAACCTGAGAGAAATGAATGCTTCTTGCAACACAAAGA 
RETYGEMADCCAKQEPERNECFLQHKD 

330 340 350 360 370 380 390 400 

TGACAACCCAAACCTCCCCCGATTGGTGAGACCAGAGGTTGATGTGATGTGCACTGCTTTTCATGACAATGAAGAGACAT 
DNPNLPRLVRPEVDVMCTAFHDNEET 

410 420 430 440 450 460 470 480 

TTTTGAAAAAATACTTATATGAAATTGCCAGAAGACATCCTTACTTTTATGCCCCGGAACTCCTTTTCTTTGCTAAAAGG 
FLKKYLYE IARRHPYFYAPELLFFAKR 

490 500 510 520 530 540 550 560 

TATAAAGCTGCTTTTACAGAATGTTGCCAAGCTGCTGATAAAGCTGCCTGCCTGTTGCCAAAGCTCGATGAACTTCGGGA 
YKAAFTECCQAADKAACLLPKLDELRD 

570 580 590 600 610" 620 630 640 

TGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTCAAATGTGCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGCAT 
EGKASSAKQRLKCASLQKFGERAFKA 

650 660 670 680 690 700 710 720 

GGGCAGTGGCTCGCCTGAGCCAGAGATTTCCCAAAGCTGAGTTTGCAGAAGTTTCCAAGTTAGTGACAGATCTTACCAAA 
WAVARL S QRFP KAEFAEV S KLVTD LTK 

730 740 750 760 770 780 790 800 

GTCCACACGGAATGCTGCCATGGAGATCTGCTTGAATGTGCTGATGACAGGGCGGACCTTGCCAAGTATATCTGTGAAAA 
VgTECCHGDLLE CADDRADLAKYICEN 

810 820 830 840 850 860 870 \ 880 

TCAGGATTCGATCTCCAGTAAACTGAAGGAATGCTGTGAAAAACCTCTGTTGGAAAAATCCCACTGCATTGCCGAAGTGG 
QDS ^SSKLKECCEKPLLEKSHCIAEV 

890 900 910 920 930 940 950 960 

AAAATGATGAGATGCCTGCTGACTTGCCTTCATTAGCTGCTGATTTTGTTGAAAGTAAGGATGTTTGCAAAAACTATGCT 
END EMPADLPSLAADFVES KDVCKNYA 

970 980 990 1000 1010 1020 1030 1040 

GAGGCAAAGGATGTCTTCCTGGGCATGTTTTTGTATGAATATGCAAGAAGGCATCCTGATTACTCTGTCGTGCTGCTGCT 
EAKDVFLG MFLYEYARRHPD YSVVLLL 
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FIGURE 2 Cont . 

1050 1060 1070 1080 1090 1100 1110 1120 

GAGACTTGCCAAGACATATGAAACCACTCTAGAGAAGTGCTGTGCCGCTGCAGATCCTCATGAATGCTATGCCAAAGTGT 
RLAKTYETTLEKCCAAADPHECYAKV 

1130 1140 1150 1160 1170 1180 1190 1200 

TCGATGAATTTAAACCTCTTGTGGAAGAGCCTCAGAATTTAATCAAACAAAACTGTGAGCTTTTTGAGCAGCTTGGAGAG 
F D E F K P L v'EEPQNI.'r'K QNCELFEQLGE 

1210 1220 1230 1240 1250 1260 1270 « 1280 

TACAAATTCCAGAATGCGCTATTAGTTCGTTACACCAAGAAAGTACCCCAAGTGTCAACTCCAACTCTTGTAGAGGTCTC 
YKFQNALLVRYTKKVPQVSTPTLVEVS 

1290 1300 1310 1320 1330 1340 1350 1360 

AAGAAACCTAGGAAAAGTGGGCAGCAAATGTTGTAAACATCCTGAAGCAAAAAGAATGCCCTGTGCAGAAGACTATCTAT 

RNLGKVGSKCCKHPEAKRMPCAEDYL 

1370 1380 1390 1400 1410 1420 1430 1440 

CCGTGGTCCTGAACCAGTTATGTGTGTTGCATGAGAAAACGCCAGTAAGTGACAGAGTCACAAAATGCTGCACAGAGTCC 
SVVLNQL CVLHEKTPVSDRVTKCCTES 

1450 1460 1470 1480 1490 1500 1510 1520 

TTGGTGAACAGGCGACCATGCTTTTCAGCTCTGGAAGTCGATGAAACATACGTTCCCAAAGAGTTTAATGCTGAAACATT 
LVNRRPCFSALEVDETYVPKEFNAETF 

1530 1540 1550 1560 1570 1580 1590 1600 

CACCTTCCATGCAGATATATGCACACTTTCTGAGAAGGAGAGACAAATCAAGAAACAAACTGCACTTGTTGAGCTTGTGA 
TFHADICTLSEKERQIKKQTAL7ELV 

1610 1620 1630 1640 1650 1660 1670 1680 

AACACAAGCCCAAGGCAACAAAAGAGCAACTGAAAGCTGTTATGGATGATTTCGCAGCTTTTGTAGAGAAGTGCTGCAAG 
KHKPKATKEQLKAVMDDFAAFVEKCCK 

1690 1700 1710 1720 1730 1740 1750 1760 

GCTGACGATAAGGAGACCTGCTTTGCCGAGGAGGGTAAAAAACTTGTTGCTGCAAGTCAAGCTGCCTTAGGCTTATAACA 
ADDKETCFAEE GKKLVAASQAALGL 

1770 1780 
TCTACATTTAAAAGCATCTCAG 
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FIGURE 3 Construction of tnH0Bl6 





Digest 
with 
Hindlll 
+ PstI 



He 

Linker l 



Ligate 
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Hindlll- 
PscI 



I pDBDl U_ 




°igest 

an <* Biaa- 



B = BamHI; E = EcoR l; H - Hindl ll ; 
He = HiacI I; P - PstI; X - Xhol 
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FIGURE 4 Construction of pH0B31 
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Gin Ala Gin Gin net 
Cys Tyr Asp Asn Gly 
Asn Val Leu Val Cys 
Glu Ala Glu Glu Thr 
Tyr Glu Arg Pro Lys 
Arg He Ser Cys Thr 
Asp Thr Trp Arg Arg 
Asn Gly Lys Gly Glu 
Gly Thr Ser Tyr Val 
Asp Cys Thr Cys Leu 
Asn Asp Gin Aso Thr 
Arg Sly Asn Leu Leu 
Arg His Thr Ser Val 
Ala Val Tyr Sin Pro 
Ser Sly Val 'Val Tyr 
Leur Cy* Thr Cys 1_eu 
Gly Sly As** Ser- Asn 
Tyr 5*2: Cys ,THr Thr. 
Tyr* Glu - Sin -Asp Sin. 
iSly Gly- Asn* Ser Asn. 
Tbr Asp: CVs Thr Ser 



Val 


Gin 


' Pro 


Gin 


lO 
Ser 


Lys 


His 


Tyr 


Gin 


30 

lie 


Thr 


Cys 


Tyr 


Gly 


50 
Gly 


Cys 


Phe 


Asp 


Lys 


70 
Tyr 


Asp 


Ser 


net 


He 


90 
Trp 


lie 


Al a 


Asn 


Arg 


110 
Cys 


Pro 


Hi s 


Glu 


Thr 


ISO 
Gly 


Trp 


Thr 


Cys 


Lys 


1 50 
Pro 


Val 


Gly 


Glu 


Thr 


170 
1 r*p 


Gly 


Glu 


Gly 


Ser 


190 
Gly 


Arg 


Thr 


Ser 


Tyr 


210 
Arg 


Gin 


Cys 


He 


Cys 


230 
Thr 


Gin 


Thr 


Thr 


Ser 


250 
Ser 


Sin 


Pro 


His 


Pro 


270 
Gin 


Ser 


v*r 


Gl y 


/let . 


290 • 
.Bin 


Gly 


Asn 


Sly 


Val 


3X0 
Ser 


Sly 


Glu 


Pro 


Cys 


330 
Val 


Glu Gly 


Arg 


Gin 


350 
Asp 


Lys 


Tyr 


Ser Phe 


370 
Cys 


Sly Ala Leu- Cys. His 


siu 


Gly Arg 


Arg 


4T0" 
Asp 



Pro Val Ala Val Ser 
Asn Gin Gin Trp Glu 
Ser Arg Gly Phe Asn 
Thr Gly Asn Thr Tyr 
Asp Cys Thr Cys He 
Hi s Glu Gly Gl y 'Gin 
Gly Tyr Met Leu Glu 
He Ala Glu Lys Cys 
Glu Lys Pro Tyr Gin 
Arg lie Thr Cys Thr 
He Gly Asp Thr Trp 
Gly Asn Sly Arg Gly 
Gly Ser Gly Pro Phe 
Pro Pro Pro Tyr Gly 
Trp Leu. Lys Thr Gin 
Cys Gin Glu Thr Ala 
Leu Pro- Phe Thr Tyr 
Gly His- Leu Trp Cys 
Thr Asp. His Thr Val 
Phe Pro Phe Leu Tyr 
Asrr net Lys Trp Cys 



Gin 


Ser 


Lys 


Pro 


20 
Gly 


Arg 


Thr 


Tyr 


Leu 


40 
Gly 


Cys 


Glu 


Ser 


Lys 


60 
Pro 


Arg 


Val 


51 y 


Asp 


SO 
Thr 


Gly 


Ala 


Gly 


Arg 


1 00 
Gly 


Ser 


Tyr 


Lys 


lie 


1 20 
Gly 


Cys 


Val 


Cys 


Leu 


1 40 
Gly 


Phe 


Asp 


His 


Ala 


1 60 
Ala 


Gly 


Trp 


Met 


Met 


Val • 


Ser 


Arg 


Asn 


Arg 


Cys 


Ser 


Lys 


Lys 


Aso 


Asn 


Glu 


Trp 


Lys 


Cys 


Glu 


Thr 


Asp 


Val 


Arg 


Ala 


His 


Cys 


Val 


Thr 


2SO 
Asp 


Gly 


Asn 


Lys 


Gin 


300 
net 


Val 


Thr 


Sin 


Thr 


320 
Tyr 


Asn 


Gly 


Arg 


Thr 


3AO 
Phe 


Ser 


Thr 


Thr 


Ser 


360 
Asn 


Leu 


Val 


Sin 


Thr 


380 
Gin 


Asn 


Asn 


His 


Asn 


AOO 
Tyr 


Gly 


Thr 


Thr 


GTn 


420 
A'sn 



•FIGURE 5A 
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•FNDEL.1 



43C. d4 0 
Tyr -ASp AT*-' Asp- -SIti Lys Phe Sly Phe "Cys ProTlet Ala Ala His Glu 61 u lie Cys Th 

- ^SO. 4£0 
-Tnr"- Aw* Siu "Sly ' v *l "et Tyr Acg Tie Gly Asp tain Trp Asp Lys Gin His Asp .net Gl y 

470 *SO 
•"■His' net ttet"7*rg Cys Thr Cys Val Gly Asn Gly Arg Gly Glu Trp Thr Cys Tyr Ala Tyr 

490 SCO 
Ser STn'Leu Arg Asp Gin Cys lie Ual Asp Asp lie Thr Tyr Asn Val Asn Asp Thr Phe 

■ SIO _ 320 

His "Lys Arg' Hi's Glu Glu Gly His Met Leu Asn Cys Thr Cys Phe Gly Gin y Arg Gly 

530- * • 54 O 

Arg Trp Lys Cys Asp Prq .V*i Asp. Gin Cys GLn...Asp Ser Glu-.Thr Gly Thr Phe Tyr Gin 

. * * 560 

Lie Sly Asp .Ser Trp Glu Lys . Tyr.. Val .Hi_*. Gl y Val Arg Tyr Gin Cys Tyr Cys Tyr Gly 

" • STO* _ 530 

Arg Sly £L*_Sly Glu .Trp Kis Cys. .GLa. Pro -Leu Gin Thr Tyr Pro Ser Ser Ser Gly Pre 

Val Siu . VaJ--rtve He Thr Glu Thr Pro Ser -Gin -Pro Asn Ser His Pro lie Gin Trp Asn 

Ala .Pro JS^./^o Ser His lie Ser Lys Ty£ He Leu Arg Trp Arg Pro Lys Asn Ser Si? 

630 _ _ o-O 

.Sly ^r* Trp-Ly^rSiu Ala Thr lie Pro .Sly His Leu Asn Ser i/r Thr lie Lys Gly Leu 

Lso &*a0 

Lys Pro Sly v V*i v*l Tyr- Glu Gly Gin Leu He Ser II* Gin Sin Tyr Gly His Gin Glu 
-Va.i-.~hrr J=c-g Phe Asp Pne Thr Thr Tnr Thr Ser Thr Pro V a ; Thr Ser Asn Thr Zli 

Thr. Gly -SUi Tnr-Thr Pro Phe Ser Pro llZ Val Ala Tnr Ser Siu Ser V*i Thr Glu he 

Thr Ala Ser" Ser Phe V*i V*i Ser Trp Vai Ser Al a Ser Asp Thr Vai Ser Giy Phe Arg 

730 7 *° 
Val Siu Tyr SI U Leu Ser Glu Glu Gly Asp Glu ?r? Gin Tyr Leu Asp Leu rrc Ser ihr 

Ala Thr Ser ^ai Asn lie Pro Asp L*u LeS Pro Sly Arg Lys Tyr tie V ai Asn Val Ty". 



r 



Sin He Ser 6J u Asc Sly 51" Gin Ser- Leu tie Leu Ser Tnr Ser Sin Thr ~nr Ala Pre 
•Asp Alt P-o — " esc ?re T.tr v ai Aso Sin Vai Asp Ase Thr Ser Val v»> ?r C ' 



810 a ~ f ' 



Ser Ar- 5 ?r, Sir. Al* Pre He Tnr Gly Tyr Ar 9 11. v*i Tyr Ser P-c Ser v,; 3 lw Gl 
Ser Ser Tnr Glu Leu Aen Leu C-a Glu Thr" Ai * A«* 5e- V»: Th- Leu Se- Leu 
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930 

Pro Gly Val Gin Tyr Asn lie Thr He Tyr Ala Vai Glu Glu Asn Gin Slu Ser Thr Pro 

870 * sao 

Val Val lie Gin Gin Glu Thr Thr Gly Thr Pro Arg Ser Asp Thr Vai Pro Ser Pro Arg 

890 

Asp Leu Gin Phe Val Glu Val Thr Asp Vai Lys Val Thr He Met Trp Thr Pro Pro Glu 

910 920 

Ser Ala Val Thr Gly Tyr Arg Val Asp Vai lie Pro Val Asn Leu Pro Gly Glu His Gly 

930 940 

Sin Arg Leu Pro lie Ser Arg Asn Thr Phe Ala Glu Val Thr Gly Leu Ser Pro Gly Vai 

950 9 *>0 
Thr Tyr Tyr phe Lys Val Phe ALa Val Ser His Gly Arg Glu Ser Lys Pro Leu « hr Ala 

970 2 9 ° 
Gin Gin Thr Thr Lys Leu Asp Ala Pro Thr Asn Leu Gin Phe Val Asn Glu Thr Asp Ser 

990 1 000 

Thr Val Leu Val Arg Trp Thr Pro Pro Arg Ala Gin He Thr Gly Tyr Arg Leu Thr Val 

10 10 1020 

Gly Leu Thr Arg Arg Gly Gin Pro Arg Gin Tyr Asn Val Sly Pro Ser Val Ser Lys Tyr 

1030 10AO 

Pro Leu Arg Asn Leu Gin Pro Ala Ser Glu Tyr Thr Val Ser Leu Val Ala He Lys Gly 

1050 1060 

Asn Sin Glu Ser Pro Lys Ala Thr Gly Vai Phe Thr Thr Leu Gin Pro Gly Ser Ser lie 

io70 ioao 

Pro Pro Tyr Asn Thr Glu Val Thr Glu Thr Thr lie Vai tie Thr Trp i hr Pro Ala Pro 

1090 1100 

Arc-Jle Sly Phe Lys Leu Gly Val Arg Pro Ser Gin Gly Gly Glu Ala Pro Arg Glu Val 

1110 1 120 

Thr Ser Asp Ser Sly Ser lie Vai Val Ser Sly Leu Thr Pro Gly Vai Glu Tyr Val Tyr 

Thr lie Gl-n Vai Leu Arg Asp Gly- -SI.* Glu Arg Asp Ala Pro He Val Asn Lys Val Vai 

1150 1160 

Thr Pro Leu Ser Pro Pro Thr Asn Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Lsu 

1170 „ _ iiSO 

Thr Val Ser Trp Glu Arg Ser Thr Thr" Pro Asp He Thr Gly Tyr Arg lie inr Thr i hr 

1190 f200 

Pro Thr Asn Sly Sin Gin Gly Asn Ser Leu Glu Glu Val Val His Ala Asp Gin Ser Ser 

1210 _ 1220 

.Cys -HrThe-fts?' Asn Leo Ser Pro Gly Leu. Glu Tyr Asn Vai Ser Vai Tyr Thr Vai Lys 

1230 1240 

.AWvfcsp*Lys Glu Ser Vai Pro lie Ser Asp Thr He Urn Pro Ala Vai Pro Pro Pro Tnr 

1250 ^ 1260 t 

fisv L*u Arg Phe Thr Asn- lie Sly-Pro- Aw Thr net Arg Vai Thr Trp Ala Pro Pro Pro 
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.FNDEL.1 • 

• 1270 1290 

Ser -lie A*p Leu- Thr Asn .Phe Leu Vai Arg Tyr Ser Pro Vai Lys Asn GXu Slu Aso Vai- 

1 2^0 1 300 

Pi* Slu-l^^r/Tle Ser Pro Ser Asp Asn Ala Vai. Vai Leu Thr Asn Leu Leu Pro Gly 

Thr-Sl-a-Tyr Vai Vai Ser VaV Ser Sv^al Tyr Glu Sin His Slu Ser Thr Pro Leu flrg 

fc330 * 340 
Sir Arg Sin Lys Thr Sly Lw'fep Ser'Pro-Thr Gly lie Asp Phe Ser Asp He Thr Ala 

■ . '-TZZQ l3oO 

Asn S«r-FHe Thr Vai His Trp*-Tlw"*rar-Pra Arg Ala Thr lie Thr Gly Tyr Arg lie Arg 

-1370 1380 

ms-Hrs-Pro Glu His Phe 3er BlV&rg Pro Arg Glu Asp Arg Vai Pro His Ser Arc Asn 

* * 1390 1400 

-Ser lie Thr Leu Thr Asn L^' Thr Pro 61 y Thr Glu Tyr Vai Vai Ser He Vai Ala Leu 

1410 1520 

Asn Gly Arg Glu Glu Ser Pro Leu Leu II. Gly Gin Gin Ser Thr Vai Ser Asp Vai rro 

1430 1*40 

Arg Asp Leu Slu Vai Vai Ala Ala Thr Pro Thr Ser Leu Leu lie Ser T.-p Asp Ala Pro 

liiSO 1460 

Ala Vai" Thr Vai Ar 5 Tyr Tyr Arg lie Thr Tyr Gly Slu Thr Sly Sly Asn Ser Pro Vai 

1470 1430 

Sin SLu Phe Thr Vai Pro Gly Ser Lys Ser Thr Ala Thr lie Ser Gly Leu Lys Pre Gly 

" * . 1 A <9C 1 500 

v*i .Asp Tyr Thr lie Thr Vai Tyr Ala'vai Thr Gly Arc Gly Aso S«r Pro Ala Ser Ser 

Ly? Pro' 11 m Ser He Asn Tyr Arg Thr'liu lie Asp Lys «r« Ser Gin nee Sin u*i*¥n- 



*40 



Tyr Arg Vai Thr Thr Thr Pro Lys Asn Siv Pre Gly Pre Thr Lys Tnr Lys Thr Ala 31 



Asp Vai Gin Aso Asn Ser lie Ser Vai *CyS Trp Leu P-c Ser Ser Ser Pro Vai Thr Gly 

1330 _ . A , 

Pre Aso Sir- Thr Glu M« Thr lie Qli/lly Leu Gin Pre Tnr Vai 61 u Tyr vai v *-'^r 

vai Tvr Glr. Asn Pr 3 Ser Gly Siller* Sin =-c Leu vai Gin Th- Ala Vai Thr' Thr 

He *~ Ala Pre Tnr Ase Leu Lys Phe 'thr Glr Vai Thr »ro Thr Ser Leu Ser Al/stn 

Tr - r-r Vc P-e fi»r. val Gin Leu Tn^G^C Tvr A-c Vai Arg vai Th- P-o uys bl- -v» 

7»r Sly Pre n e -. «_y, Slu lie A»r. L.uAla P"3 Ser Se- Ser vai Vai Vai 5.- Sly 

Leu nee va; Ala Tnr ._y, Tv- Sic Val'l^ Vai 7v- 41* -eu -v, T.->- ^'^ 



' FXGTJRE 5D 
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14*90 - -1.700 

Am Pro Ala Sin Gly Val Val Thr Thr Leu Glu ten Val Ser Pro Pro *-g Arg Ala Arg 

" 1710 • _, * 720 

V»i .T7v--A»..Ala Thr.-Glu Thr Thr lie Thr .LI. .Ser Trp Arg Thr Lys Thr Glu Thr It* 

1730 1740 

Thr Gly'Phe Bin VaI Asp Ala Val Pro Ala Asn Gly Sin Thr Pro He Sin Arg Thr lis 

17SO _ l7i0 

I_v3 Pro Asp' Val Arg Ser Tyr Thr lie Thr Gly Leu, Gin Pro Gly Thr Asp Tyr Lys He 

1770 1780 
Tvr-. Leu Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val He Asp Ala Ser Thr 

1790 1800 
Ala He Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr Pro Asn Ser Leu Leu Ual 

Ser Trp Gin Pro Pro Arg Ala Arg Ile'fhr Gly Tyr He He Lys Tyr Slu Lys Pro^Fy 

1 S30 1 940 

Ser Pro Pro Arg Glu Val Val Pro Arg Pro Arg Pre Gly Val Thr Glu AU Thr He .hr 

i 9S o 1 360 

Gly Leu Glu Pro Gly Thr Glu Tyr Thr He Tyr Val He Ala L-u Lys Asn Asn Gin Lys 

Ser Glu Pro Leu He Gly Arg Lys Lys ' fir Aso Glu Leu Pro Gin Leu Val Thr Leu' Pro 

His Pro Asn Leu His Gly Pro Glu II. TO Asp Val Pro Ser Thr Vai Gin Lys Thr'p'ro 

tPIO - 1^20 

Phe Val Thr His Pro Gly Tyr Asp Thr Sly Asn Gly He Gin Leu Pro Gly Thr S-r 

Sin Gin Pro Ser Val Gly Gin Gin flet'lfS Phe Slu Glu His Gly Phe Arg Arg Th ^'^ 

Pro Pro Thr Thr Ala. Thr Pro. He Arg 1 "? Ar, Pro *rg Pro Tyr Pro Pro Asn Vai 'ttl 
• * - - * 1980 

Leu. Ser. Gin Thr Thr He .Ser Trp Ala/™ Phe Sin -Asp Thr- Ser Glu Tyr He II. Ser 

i — 2000 

Cya^.Pi-cVal Gly. Thr Asp Glu Glu Pro Leu Sin Phe Arg Val Pro Gly i hr <~V^ 

Ser Al^-Xhr- Leu.Thr Gly Leu Thr Arg^ly M+yThr Tyr- Asn He He Val Glu Aia^S 

.Lys-.Asp-Sln Sin Arg- His Lys Vai Arg 2 l!u Slu- Val Val Thr Val Gly As. Ser Val^j 

& u-..Sly -L^u Asn Sl.-Pro Thr Asp Ashler Cys Phe Asp Pro Tyr Thr Val Ser His^r 

Al^Val. Sly Asp Slu Trp Glu Arg (letter Slu Ser Gly Phe Lys Leu Leu Cys Gln'Cys 

*..«r..SlS^.. Gl^.Ser:Siy^ S Ph* Arg 2 Sys Asp Ser Ser Arg Trp Cys His Asp Asn 2 i?y 



JTGTJRE 5E 
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FN DELI 

-7110 - 2120 

Vai-'-fti^-Tyn-Ly* He GlyGlu Lys Trp~Asp Arg Sin Sly Sl^ Asn Sly Gin rtet tt« Ser 

2l3Q - 2140 

Cv-s'Ttur Cvs Lau 61y Ain'G^ tys Sly GIu Pbe Lys Cys Asp Pro His Gl u Ala Thr Cys 

" 21SO - 21^0 

[Tyr. *»~ ftsp 51yL>^ Thr! Tyr HfeVal Sly SI a -Gin Trp Gin Lys 61 u Tyr L*u Sly Ala 

21 70 * * 2 ISO 

rt3^ S«:~C** "Sftri-Cys ; Ph» Sly Sly Sin Arg Sly Trp Arg Cy-s Asp Asn Cys Arg Arg 

- ' ~ 2 1 - • 2200 

ffebr. S*y Sly::Gl«-.Pra. S^. Pro. GIU Gly . Ttoe Tnr Sy Sin S«r Tyr Asn Gin Tyr Ser Gin 

. 22 1C 2220 

- Ar^,l^ Hi3 Sin W'tte-Asn Thr Asn V*i Asn Cys* Pro He 81 u Cys Pn* net Pro Leu 

2230 

';Asp--ViLl- Sin Ala- Asp Arg 5Kr Asp Ser"Arg Glu 



• FIGURE 5F 
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Fig. 7 Construction of pDBDF2 
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Fig. 8 Construction of pDBDF5 
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• 9 Construction of pDBDF9 
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'Pstl-EcoRI fragment 




Fig. 10 Construction of pDBDF12 
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Figure 11 

Name: pFHDELl 

Vector: pUCIS Amp fv 2S60bp 

Insert: hFNcONA ~ 7530bp 
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